Search CORE

8 research outputs found

That is a Known Lie: Detecting Previously Fact-Checked Claims

Author: Babulkov Nikolay
Martino Giovanni Da San
Nakov Preslav
Shaar Shaden
Publication venue
Publication date: 01/01/2020
Field of study

The recent proliferation of "fake news" has triggered a number of responses, most notably the emergence of several manual fact-checking initiatives. As a result and over time, a large number of fact-checked claims have been accumulated, which increases the likelihood that a new claim in social media or a new statement by a politician might have already been fact-checked by some trusted fact-checking organization, as viral claims often come back after a while in social media, and politicians like to repeat their favorite statements, true or false, over and over again. As manual fact-checking is very time-consuming (and fully automatic fact-checking has credibility issues), it is important to try to save this effort and to avoid wasting time on claims that have already been fact-checked. Interestingly, despite the importance of the task, it has been largely ignored by the research community so far. Here, we aim to bridge this gap. In particular, we formulate the task and we discuss how it relates to, but also differs from, previous work. We further create a specialized dataset, which we release to the research community. Finally, we present learning-to-rank experiments that demonstrate sizable improvements over state-of-the-art retrieval and textual similarity approaches.Comment: detecting previously fact-checked claims, fact-checking, disinformation, fake news, social media, political debate

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Padova

Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media

Author: Ali Zien Sheikh
Babulkov Nikolay
Barron-Cedeno Alberto
Elsayed Tamer
Hamdan Bayan
Haouari Fatima
Hasanain Maram
Martino Giovanni Da San
Nakov Preslav
Nikolov Alex
Shaar Shaden
Suwaileh Reem
Publication venue
Publication date: 15/07/2020
Field of study

We present an overview of the third edition of the CheckThat! Lab at CLEF 2020. The lab featured five tasks in two different languages: English and Arabic. The first four tasks compose the full pipeline of claim verification in social media: Task 1 on check-worthiness estimation, Task 2 on retrieving previously fact-checked claims, Task 3 on evidence retrieval, and Task 4 on claim verification. The lab is completed with Task 5 on check-worthiness estimation in political debates and speeches. A total of 67 teams registered to participate in the lab (up from 47 at CLEF 2019), and 23 of them actually submitted runs (compared to 14 at CLEF 2019). Most teams used deep neural networks based on BERT, LSTMs, or CNNs, and achieved sizable improvements over the baselines on all tasks. Here we describe the tasks setup, the evaluation results, and a summary of the approaches used by the participants, and we discuss some lessons learned. Last but not least, we release to the research community all datasets from the lab as well as the evaluation scripts, which should enable further research in the important tasks of check-worthiness estimation and automatic claim verification.Comment: Check-Worthiness Estimation, Fact-Checking, Veracity, Evidence-based Verification, Detecting Previously Fact-Checked Claims, Social Media Verification, Computational Journalism, COVID-1

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Overview of the CLEF–2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News Detection

Author: Alam Firoj
Babulkov Nikolay
Barrón-Cedeño Alberto
Caselli Tommaso
da San Martino Giovanni
Kartal Yavuz Selim
Kutlu Mucahid
Köhler Juliane
Li Chengkai
Mandl Thomas
Mubarak Hamdy
Míguez Rubén
Nakov Preslav
Nikolov Alex
Shaar Shaden
Shahi Gautam Kishore
Siegel Melanie
Struß Julia Maria
Wiegand Michael
Zaghouani Wajdi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

We describe the fifth edition of the CheckThat! lab, part of the 2022 Conference and Labs of the Evaluation Forum (CLEF). The lab evaluates technology supporting tasks related to factuality in multiple languages: Arabic, Bulgarian, Dutch, English, German, Spanish, and Turkish. Task 1 asks to identify relevant claims in tweets in terms of check-worthiness, verifiability, harmfullness, and attention-worthiness. Task 2 asks to detect previously fact-checked claims that could be relevant to fact-check a new claim. It targets both tweets and political debates/speeches. Task 3 asks to predict the veracity of the main claim in a news article. CheckThat! was the most popular lab at CLEF-2022 in terms of team registrations: 137 teams. More than one-third (37%) of them actually participated: 18, 7, and 26 teams submitted 210, 37, and 126 official runs for tasks 1, 2, and 3, respectively.</p

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

That is a Known Lie: Detecting Previously Fact-Checked Claims

Author: Babulkov Nikolay
Da San Martino Giovanni
Nakov Preslav
Shaar Shaden
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2020
Field of study

The recent proliferation of \u201dfake news\u201d has triggered a number of responses, most notably the emergence of several manual fact-checking initiatives. As a result and over time, a large number of fact-checked claims have been accumulated, which increases the likelihood that a new claim in social media or a new statement by a politician might have already been fact-checked by some trusted fact-checking organization, as viral claims often come back after a while in social media, and politicians like to repeat their favorite statements, true or false, over and over again. As manual fact-checking is very time-consuming (and fully automatic fact-checking has credibility issues), it is important to try to save this effort and to avoid wasting time on claims that have already been fact-checked. Interestingly, despite the importance of the task, it has been largely ignored by the research community so far. Here, we aim to bridge this gap. In particular, we formulate the task and we discuss how it relates to, but also differs from, previous work. We further create a specialized dataset, which we release to the research community. Finally, we present learning-to-rank experiments that demonstrate sizable improvements over state-of-the-art retrieval and textual similarity approaches

Crossref

Archivio istituzionale della ricerca - Università di Padova

Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media

Author: Ali Zien Sheikh
Babulkov Nikolay
Barr\uf3n-Cede\uf1o Alberto
Da San Martino Giovanni
Elsayed Tamer
Hamdan Bayan
Haouari Fatima
Hasanain Maram
Nakov Preslav
Nikolov Alex
Shaar Shaden
Suwaileh Reem
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Crossref

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

The CLEF-2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News Detection

Author: Alam Firoj
Babulkov Nikolay
Barrón-Cedeño Alberto
Beltrán Javier
Caselli Tommaso
Da San Martino Giovanni
Kartal Yavuz Selim
Kutlu Mucahid
Li Chengkai
Mandl Thomas
Mubarak Hamdy
Míguez Rubén
Nakov Preslav
Nikolov Alex
Shaar Shaden
Shahi Gautam Kishore
Struß Julia Maria
Zaghouani Wajdi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

The fifth edition of the CheckThat! Lab is held as part of the 2022 Conference and Labs of the Evaluation Forum (CLEF). The lab evaluates technology supporting various factuality tasks in seven languages: Arabic, Bulgarian, Dutch, English, German, Spanish, and Turkish. Task 1 focuses on disinformation related to the ongoing COVID-19 infodemic and politics, and asks to predict whether a tweet is worth fact-checking, contains a verifiable factual claim, is harmful to the society, or is of interest to policy makers and why. Task 2 asks to retrieve claims that have been previously fact-checked and that could be useful to verify the claim in a tweet. Task 3 is to predict the veracity of a news article. Tasks 1 and 3 are classification problems, while Task 2 is a ranking one.</p

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Padova

Dissertations of the University of Groningen

The CLEF-2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News Detection

Author: Alam Firoj
Babulkov Nikolay
Balog Krisztian
Barrón-Cedeño Alberto
Beltrán Javier
Caselli Tommaso
Da San Martino Giovanni
Hagen Matthias
Kartal Yavuz Selim
Kutlu Mucahid
Li Chengkai
Macdonald Craig
Mandl Thomas
Mubarak Hamdy
Míguez Rubén
Nakov Preslav
Nikolov Alex
Nørvåg Kjetil
Seifert Christin
Setty Vinay
Shaar Shaden
Shahi Gautam Kishore
Struß Julia Maria
Verberne Suzan
Zaghouani Wajdi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Overview of the CLEF–2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News Detection

Author: Alam Firoj
Babulkov Nikolay
Barrón-Cedeño Alberto
Barrón-Cedeño Alberto
Caselli Tommaso
da San Martino Giovanni
Da San Martino Giovanni
Degli Esposti Mirko
Faggioli Guglielmo
Ferro Nicola
Hanbury Allan
Kartal Yavuz Selim
Kutlu Mucahid
Köhler Juliane
Li Chengkai
Macdonald Craig
Mandl Thomas
Mubarak Hamdy
Míguez Rubén
Nakov Preslav
Nikolov Alex
Pasi Gabriella
Potthast Martin
Sebastiani Fabrizio
Shaar Shaden
Shahi Gautam Kishore
Siegel Melanie
Struß Julia Maria
Wiegand Michael
Zaghouani Wajdi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study